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© A recombinant DNA fragment (2) coding for an immunoglobulin G binding domain related to staphylococcal 
protein A, characterized in that the methionine codon of said fragment has been replaced by a codon of another 
amino acid residue enabling expression of a methioninefree protein; 
a recombinant DNA sequence comprising at least two such fragments: 

a recombinant DMA sequence comprising such fragment preceded by a signal sequence and followed by a 
nucleotide sequence coding for the amino acid sequence: 
Ala Gin His Asp Glu Ala; 

a recombinant DNA molecule comprising such recombinant DNA sequence and fused 3' thereof at DNA level a 
production gene, said molecule having the ability to express a fused protein; 

a process for cleaving a fused protein expressed In a biological system by a such recombinant DNA molecule; 
a plasmid vector comprising said recombinant DNA molecule; and 
a bacterial cell harbouring such recombinant DNA molecule. 
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■tlgn gf an jgG binding protein to facljjtate down^ am processlno Mslnn . nfntaln gngloeerlna 

The present Invention relates to a recombinant Dna fra«r«^# , 

(hereinafter called .gG, binding domain related to ^^^^1^1" ,m ™"°* Mn Q - 
such fragments and to a process tor cleavage of a ^sed^iin * S sequences comprising 

sequence. The invention also re.ates to piasmid vectors and TT? L^ 9 S ° Ch fra9mem or 
s DNA fragments or sequences. Basically, me pTesenUnven^ Ce " S harbourin 9 such "combine* 

and purifying foreign proteins expressed in bacteria. ^ * " " nprewd SyStem '° r produc,n 9 

exprets^ * — - transient 

staphylococcal protein A any gene product un be^ourifir^a* P roces sing. By making a gene fusion to 

ro purified in a s.ng.e step usinglgG aSty ^^chrom^ooraohv W. £ 'T^'" ^ and can thus be 

syntehtic gene encoding human TinsulinTke gXS^So, . S P , tk"^ 5*** Pr ° tein A 9806 ,0 3 
recover In high yield from the g"medrjnTof 1£>h JT°" " * P ' 0,ein eXPr " SSed C0U,d be 

gene product consisting of i^^S&n""* JT alS0 ^ ^ 3 
a method described in our Swedish pa^nt aDDlSi on 1ml ^ ^ w f s K ecre,ed ,rom «» EB* ce " by 

rs reference: filed simultaneously h^ewim) PP ' ,Ca,, ° n (th6 d ' SC,OSUre <* which is Incorporated herein by 

fcC^srr s^s 0 ^ r^r-: ,oo,s for e r^ - — *■ 

release a biologically active peptide or mZ' y^TZ' !! dependent °" Processing after purification to 
preference comparea to , wts^ctesE^Tw ^ Chavage m9tnods are in 

sequ^f^^^^ 'T^ * dip ° p « d ° 

dependent on If the amino acldfs) wi*Z fo L 21 J ^7?" SPeC, " C ,0r Met ^ Cholce of me,hod ls 
internal methionine and ^pThas Prot^ A nTZ T?* *~ Pf ° dUCt " n0t IGF -' has 
region and 5 Asn-Gly In the \FtE£££* of p^A^s Z^Z^T ^ 

domJn 9 sTLfrM^t^'XX^ ^1? ^ 5 ^ " ^ "* • 
palindromic A^cJ sites are preferablv •hSLSZS 1 1 T sequence. At the same time two non 

domain to encumber oHgG Sg Z£nT *" ,D b * 8b ' e to »• ^ binding 

immunoglobulin G ^6o^^^Z^^^T ^ "* diSC,OSUr0) COdi ° 9 ,or *" 
that the methionine codon of saiS ta^n52S^?lS2 T !^ "T^ be ' n9 «" 
enabling expression of a methionine^ ZSn ^!s pre^tnat th?^ ° TTT ^ 
residue Is that of leucine. preferred that the codon of said another amino acid 

been n re a plST Z^Z«£ !££^rl -P^^-ionlne codona have 

digestion of natfJe protein ^SZS^^SLST T ° N °" 1 * dofined * ^ 

the sequence GTAGTfu^sh^a^aS^c^ ,1? C ° d ° n - S ° " l ° ^ at ^ — « d « *-* 

~**n has been mplaced by an 

comprising * teasTt^ Z-fSg^nts ^10?^^ 1 reC0 ™ bln *« DNA sequence 

preferably within the range wSZ^^^ZZSZ* "** ama ' 9amatSd Z ' fr « ,s 

eocCS^liS roTB^L^^H 0 ^ ' S V***" ,0r a DNA fragment 

Asn^y coding con^eLL\ B as C b e:re^ * ^ - 9 ^ — » * - 

aboT^^bf: EfaeSent^r 1 ^ sedue ^ «-P^9 the 2-fragmen« as defined 
sequence: Ala Ih.s A^gLa^ * * mC ^ 6 S8quence codino «>< «» aoid 
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The invention also covers a recombinant DNA molecule comprising the recombinant DNA sequence as 
described above and fused 3' thereof at DNA level a pro duction gene. By this arrangement such molecule 
obtains the ability to express a fused protein In a suitable host. Such production gene may be that of a 
somatomedin, examples of which are: growth hormones or factors, such as hGH (human growth hormone). 

5 IGF-I. IGF-II, NGF (Nerve Growth Factor). EGF (Eplthermal Growth Factor) and POGF (Platelet Derived 
Growth Factor). The production gene may also be one coding for an interferon. interleuKin-2 Insulin, 
neuropeptide, gastrointestinal peptide etc. Specifically, the production gene is that of IGF-I or IGF-II. The 
production gene may also code for a structural gene for an enzyme or parts thereof. 

It is preferred that In such recombinant DNA molecule the N-terminal glycine codon is preceded by an 

10 asparagine codon to enable hydroxyl amino cleavage of the peptide bond to release the native protein, such 
as IGF-I. In another embodiment the N-terminal codon is preceded by a methionine codon to enable 
cyanogen bromide cleavage of the peptide bond to release native protein, such as IGF-II. 

According to still another aspect of the invention there is provided a process for cleaving a fused 
protein expressed In a biological system by the recombinant DNA molecule as defined above. Such 

is treatment Is suitably performed by hydroxyl amine treatment when the N-terminal glycine codon is 
preceded by an asparagine codon. When preceded by a methionine codon the cleavage is preferably 
performed by cyanogen bromide treatment. 

Finally, the invention covers a plasmid vector comprising the recombinant DNA molecule as described 
above. The invention also extends to bacterial cells harbouring the recombinant DNA-molecule defined 

20 above. The molecule can be harboured In the chromosome of the bacterial cell but may also be contained 
In a plasmid vector. 

The host cell is for example Gram negative and is particularly constituted by an E.coli. 
The invention will in the following be further illustrated by non-limiting examples with refefence to the 
appended drawings, wherein: 

Fig. 1 shows the organization of the coding region of the protein A gen. S is the signal sequence. A-E 
are the IgG binding domains and X Is the C-terminal region with no IgG binding activity of the encoding 
polypeptide; * 

Fig. 2 shows a comparison of the different IgG binding regions. The first line shows a suggested 
consensus amino acid sequence of the IgG binding regions. The boxes show the stretches of amino acids 
Involved In the two different alpha helixes. The amino acids involved in the binding to IgG are underlined. 
The amino acids In the different regions are shown by — for no change compared to the consensus codon. 
+ for no amino acid change but a silent mutation and the letter for another amino acid for amino acid 
changes. The amino acids are given in the one letter code: 

Fig. 3 shows the nucleotide sequence of the sense strand of the synthezised Z-fragment. The 
cleavage region is the stretch of amino acids needed for processing of the signal sequence. The 2-region is 
the part of the Z-fragment coding for.the IgG binding domain. The amino acid changes are underlined. The 
restriction enzyme recognition sequences for sites used in Examples are shown; 

Fig. 4 shows the nucleotide sequence of the ZZ-IGF-I encoded by the pZZ-IGF-l plasmid vector. The 
regions encoding the signal peptide, the cleavage region, the two Z-regions and IGF-I are shown as well as 
restriction sites relevant for the construction strategy; 

Fig. 5 shows the strategy described in Examples section V. The synthetic oligomers were cloned in 
M13 mp 18 (not shown in the figure) prior to the cloning of the Z-fragment (from Hinol III to Eco. Rl) into 
pUC8. By digesting pUCB-Z with AccJ the Z-region is cleaved out and by religation followed by transforma- 
tion the pUC8-ZZ plasmid vector could be isolated. AMP Is the gene coding for the ^-lactamase gene, ori is 
the origin of replication for g.ccJL lac T is the gene coding for /5-galactosidase alpha fragment and Z Is the 
synthetic fragment; 

Pig. 6 shows the cloning strategy described in Examples section VI. AMP is the gene coding for 0- 
lactamase, S is the signal sequence. A-E are the IgG binding domains of protein A, ori is the origin of 
replication. Z is the synthetic fragment, IGF-I is the gene for IGF-I, F1 is the origin of replication from phage 
60 tl and lacZ is the gene for l-galactosidase; 

Fig. 7 shows the construction of pASZ1 and pASZ2 as described In Examples sections lit and IV. 
AMP is the gene encoding the ^-lactamase. F1 is the origin of replication for phage h, S is the signal 
sequence. A-E are the IgG binding regions, ori is the origin of replication for E.cgfl and Z is the synthetic 
fragment; ~ 

63 Rg. 8 shows the strategy of the process to purify IGF-I using the method of unique Asn-Gly 

cleavage, as described in Examples section VII; and 
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Rg. 9 shows the SDS gel electrophoresis of the proteins corresponding to the 1 different steps in the 
process described in Examples section VII. Lane 1 shows size markers in Kilo Daltons, lane 2 shows the 
hydrld protein after IgG affinity purification, lane 3 shows the result of hydroxylamine cleavage, lane 4 
shows the flow through of an IgG sepharose gel of hydroxylamine digested hybridprotein and lane 5 Is pure 
5 IGF-I (marker). The bands corresponding to ZZ-IGF-I. 22 and IGF-I are shown by arrows. 

By cleaving the cloned synthetic fragment using the restriction enzyme Accl followed by relation of 
the isolated fragment, the fragments will ligate head to tail and create a tandem repeat of fragments and in 
theory any number of regions can be obtained. The fragment used to illustrate the Invention was 
constructed in the following way: 
io Protein A consists of two distinct regions: The IgG binding region (A-E domains) and region X having no 
IgG binding activity <Fig. 1). The IgG binding region consists of five homologous IgG binding domains which 
can be cleaved apart by trypsin treatment at protein level. The B-domain has been crystallized together with 
IgG and structure determined by X-ray crystallography (Deisenhofer. J., Biochemistry. 20, 2361 (1981) 

The five IgG binding domains consist of approximately 58 amino acids (E is shorter and D is longer) 
is and the amino acid sequences of the regions are shown in Rg. 2. 
The fragment that was synthezied had the following features: 

1) Lack of any Met; 

2) Lack of Asn-Gly dipeptide sequence; 

3) The fragment is potimlzed to be synthezied at nucleotide level to facilitate clonino to oet 
20 expression; y y 

4) The fragment can be polymerized at nucleotide level to get any number of IgG binding regions- 

5) The fragment is capable of being expressed in an genetic system adapted for expression and 
secretion. 

1) There are alltogether 3 Met in the IgG binding region of protein A. In the synthetic fragment the 
SS consensus Met (the protein A consensus amino acid sequence is shown In Rg. 2) was changed in the 
synthetic fragment by changing the Asn-Met to codons for a His-Leu sequence. 

2 > ™ e Asn-Gly dipeptide sequence is sensitive to hydroxylamine. As this sequence is kept intact in 
an five IgG binding domains of protein A and as this amino add sequence is present in the middle of an 
alpha helix involved in the binding to IgG <Fig. 2) there is very little chance to be successful in any amino 
acid change. The obvious choice to change the Asn to a Gin was analyzed by computer graphics (FROOO 
software. Alwin Jones, Biomedical Centre. Uppsala, Sweden) using the coordinates available from the 
Brookhaven Protein Data Bank (Bernstein. F.C. et al J.Mol.Biol.. 112. 553-542 (1972) calculated from the X- 
ray crystallographic structure of protein A. 

Since the Asn provides for hydrogen bonds to two other residues the change of the code for the Asn to Gin 
was expected to destroy both the terlary structure of protein A and the binding to IgG. Instead, however the 
computer analysis surprisingly showed that the Gly In the Asn-Gly dipeptide sequence could bo changed to 
an Ala. This change was not obvious as glycines are among the most conserved amino acids between 
homologous protein sequences due to their special features. The features include flexibility around the 
peptide bond, the usual function to start and break alpha and beta structures and the feature as an amino 
acid havmg no net charge In the polypeptide chain. However, by simulating the Gly to Ala amino acid 
change in the computer we concluded that this change would not interfere with folding to protein A or 
binding to IgG. 

3) The fragment was synthezied at ONA level in 10 separate oligomers. To facilitate cloning of the 
fragment one Hind.Hl site was designed In the 5' end and an EsgRl site In the 3' end. (The nucleotide 

45 sequence of the fragment Is shown in Rg. 3). 

4) In order to be able to polymerize the fragment at nucleotide level a non-palindromlc Accl site was 
introduced by changing the nucleotide sequence to GTAGAC in the code tor N-terminal end of the IgG 
binding region. In that way an Ala codon present In all regions is changed to a Val. By this nucleotide 
constellation the fragment can be polymerized to any multiplicity thus altering the binding capacity of the 

so translated product. It is not obvious If this amino acid substitution will Interfere with the protein A function or 
not* 

5) By introducing a F§£l site where the signal sequense shall be attached to the fragment and by 
starting the coded fragment with the six amino adds unique for the E-region of protein A the fragment is 
capable of being attached to the protein A signal sequence by molecular cloning and the six N-termlnal 

55 amino acids were introduced to ensure secretion of the fragment. 

Specific embodiments of the Invention will now be described in detail. 
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Starting materials 

Bacteria l hosts - Three different strains of E.coli K12 were used in the Examples: 
HB101 (Boyer, H.W. et ai J.Mol.Biol., £L. 459-572 (1969). JM 83 (Yanisch- Perron. C. et al Gene, 33, 103- 
119 (1985)) and JM 103 (Messing. J. et al Methods Enzymol., 1£h , 20-79 (1983)). (The strains are available 
at the Dept of Biochemistry and Biotechnology, Royal Insitute of Technology. Stockholm, Sweden). 

Cloning vehicles : The cloning vehicles used in Examples wereM13mpl8 (Yanisch-Perron, C. et al Gene, 
33, 103-119 (1985)) and pUC8 (Vieva, J. et al Gene 19, 259 (1982)). The vector pHL33 is a vector derived 
from PEMBL19 (-) (Dente et al, Nucl. Acids Res.. H . 1645 (1983)) and pRIT4 (Nilsson, B. et al. EMBO J. 4, 
1075 (1985)) constructed in the following way: 

The plasm id pRIT4 was cleaved with Tao I and after fill-in reaction with Klenow polymerase. Not- 1 linkers 
were added. The reaction mixture was cleaved with Eco RI Not I and the fragment spanning over the protein 
A gene was isolated. This fragment was cloned into pEMBL19 (-), where the Clal site previously had been 
linked to a Notl site, by cleaving that plasmid with Notl and EcoRI. This gives a vector containing a part of 
the protein A gene followed by the mp19 multirestriction enzyme linker. 

The vector pEX4-IGF-l is pEX (Stanley.K. et al EMBO J. 2, 1429 (1984) having the syntehtic IGF-I 
cloned in Eco RI to Bam HI'. 

The synthetic gene encoding IGF-I has been described by Elmblad. A. et al In Third European 
Congress on Biotechnology III, 287-296, Vertag Chemie, Weinheim (1984). The plasmid vector pASZ2 has 
been deposited with the Deutsche Sammiung von Mikroorganismen (DSM), Gdttingen. Federal Republic of 
Germany, under No. 3594 DSM and (the designated name pE*2 in the deposition document). 



Buffers and Media 

Coating buffer : 1.59 g Na,CO,. 2.93 g NaHCO, and 0.2 g NaN,, made up to 1 litre with distilled H,Q 
P§§I: 8.0 g NaCl, 0.2 g KH,P<X 2.9 g Na,HPO. X 12H,0. 0.2 g KCL, 0.2 ml Tween®20 and 0.2 g NaN, 
made up to 1 litre with distilled H t O (pH 7.4). 

TftB : 30 g Tryptic Soy Broth, made up to 1 litre and autoclaved. 

UJ: 10 g Bacto tryptone, 5 g Yeast extract and 10 g NaCl made up to 1 litre and autoclaved. 

l,A: Plates containing LB and 15 g Bacto Agar per litre. 

TBAB : 30 g Tryptic Blood Agar Base, made up to 1 litre and autoclaved. 

ONPG-buffer 2mM o-Nitrophenyl-^-D-galactoslde (ONPG, Sigma product No N-1127) in 0.1 M potas- 
sium phosphate buffer, pH 7.3, containing 15 mM 2-mercaptoethanol and imM MgCl a . 



Routine Methods 

Certain procedures were carried out repeatedly in the Examples. Unless otherwise specified, they were 
done exactly as follows each time they were carried out. 

Methods used routinely in molecular biology are not described (like the use of commercial restriction 
enzymes, DNA-ligattons, Bal 31 exonuclease, S1 nuclease and Klenow polymerase) 

Transformations: Transformation of E.coli K12 with plasmid DNA was performed exactly as described - 
(Morrison, D.A., Methods in Enzymology, Academic Press 68, 326-331 (1979)). The transformants were 
selected in a conventional manner on plates (TBAB) containing 70 mg/l ampicillin. 

Isolation of Plasmid DNA : Plasmid DNA was Isolated as described by Birnboim, H.C. et al. Nucl.Acids 
Res. Z. 1513 (1979). Small scale preparations to screen a large number of transformants were made exactly 
as described by Kieser, T. Plasmid 1£, 19-36 (1984). 

Seoharose §B chromatography : Plasmid DNA to be used for Bal31 or Si treatment were run on a 
Sepharose 6B gelfiltratlon in a 10mM Tris. ImM EDTA and 500 mM NaCI-buffer. In this way DNA is 
separated from RNA. 

Elution oj DNA fragments . Elution of DNA fragments from either agarose or polyacrylamide gel pieces 
were performed exactly as described by Maxam et al, P.N.A.S. (USA), 7± 560-564 (1977). 

ligation of DNA in Low Gel Temperature Agarose gel : Ligation directly in agarose gel was performed 
by running the electrophoresis in a Low Gel Temperature Agarose gel and after cutting out the band the gel 
piece was melted by heating to 65°C. After a 10 times dilution using Tris buffer (10 mM pH 7.4) ligation 
could be performed. 
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Detection and quantification 2! Pioiein A. An ELISA test (Enzyme linked immunosorbent assay) wa 
used to quantify protein A. The test makes use of a special microtiter plate (TUertek, Amstelstac 
Netherlands) having no net charge. The wells are coated with human IgG (Kabi AB. Sweden) in a coatlm 
buffer. Test samples are added and protein A Is bound to the Fc portions of the IgG adsorbed in the wel 
5 Protein A is then assayed by an anti-protein A (from rabbit) conjugated to /S-galactosidase (from Pharmaci 
AB, Uppsala, Sweden). 

Assay: The wells of a microtiterplate are filled with 75 ul of a solution of human IgG at 16 ng/ml h 
Coating Buffer and the plate Is incubated at room temperature for at least 1 hour. The wells are washer 
three times with 100 ul PBST and 50 ul of sample Is added to each well. For quantitative determination ; 
10 fold dilutions are made. After incubation for 1 hour the wells are washed 3 times with 100 ul PBST followec 
by addition of 50 ul anti-protein A-0-galactosidase (the amount of protein A binding capacity added to eacf 
well corresponds to the molar amount of IgG added to each well as detected by titration with protein A lr 
excess). After incubation for 45 minutes, the wells were washed 3 times with 100 ul PBST followed b\ 
addition of 125 ul ONPG buffer. After incubation for 20-30 minutes 150 ul 0.1M NaOH was added to stoj 
15 the reaction. The quantification is made by running a 2-fold dilution of a protein A standard solution o 
known concentration in parallel with the 2-fold dilutions of the test samples. The absorbance at 405 nm i: 
measured for each well by a fotometer. 

?D$-PAQE: SDS-polyacrylamide gel electrophoresis was performed exactly as described by Laemmfi 
O.K. Nature (London), 227, 680-685 (1970) using a 10-20 % step gradient gel. 
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EXAMPLES 

I Construction of a synthetic protein A fragment (Z) 



The derived DNA sequence was analysed by a computer program and divided into 10 oligonucleotides 
varying in length from 41 to 45 nucleotides and with an overlap of 6 bp. 

Synthesis was effected on a fully automated machine and the deprotected oligomers were purified b> 
polyacrylamid electrophoresis (20 % polyacrylamide. 7 M Urea. 50 mM Tris-borate pH 8.3) followed by 
90 extraction into water and lyophilization. 

II Ligation ana* cloning of Z-fraoment 

35 100 pmoles of oligonucleotides A1-A5 and B1-B5 were phosphorylated separately in 20 ul Kinase 

buffer (50 mM Tris-HCI pH 7.6, 10 mM MgCI„ 1 mM ATP. 10 mM OTT) 5 units of polynucleotide kinase 
was added and the mixtures were incubated for 45 minutes at 37 °C. 

5 ug of the vector Ml3mp18, replicative form, was digested with the restriction enzymes EcoRI and 
Hincp. The large fragment from the digestion was isolated from a low temperature agarose geL 

40 The agarose containing digested M13 mp18 was melted at 65 *C and 5 ul (0.5 ul. 0.1 p mole) was 
mixed with 0,5 p mole each of the phosphorylated oligomers A1-A5 and B1-B5 in 50 ul ligation buffer (66 
mM Trls-HCI pH 7.6 50 mM MgCI* 50 mM DTT, 1 mM ATP) heated to 90°C and slowly cooled to room 
temperature during one hour. 10 units of T4 DNA ligase was added and the mixture was incubated 
overnight at 15*C. 

E-coJI JM 103 was transfected with the DNA thus obtained and grown overnight on 2 x YT plates 
containing x-gal and IPTG. 78 white plaques were transferred to a new 2 x YT plate and grown as colonies. 
Colonyhybridizatlon with »P labelled oligonucleotide B5 as probe gave one positive colony which was 
picked and grown in 15 ml 2 x YT with g.cpji JM 103. The cells were spun down and the phages were 
recovered from the supernatant. The single stranded phage DNA was extracted and purified and was used 
so as template for sequencing reaction according to the dideoxy method. The Mi3mp18 containing the 2- 
fragment was designated M13-2. 
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III Construction of the olasmid vector pASZ2 

The plasmid vector PHL33 was cleaved with Hindlll and EcoRI. The large fragment was Isolated from a 
1 % agarose gel after electrophoresis. The fragment was ligated to an isolated Z-fragment (Hindlll/EcoRI). 
s After ligation the mixture was transformed to g.coJi HB101 and the isolated vector PASZ1 (Fig. 7) has the 
synthetic Z-fragment cloned down stream from the signal sequence. In order to place the synthetic 
fragment directly after the signal sequence the pASZ1 vector was cleaved with FspJ. After religation the 
mixture was transformed into E.coli HB101 and pASZ2 could be isolated. 

The vector has the synthetic Z-fragment (Fig. 3) directly after the signal sequence of protein A. 
10 The construction was confirmed by DNA sequencing. 

IV Expression gl PASZ2 

1 5 Expression of pASZ2 in E.coli HB101 . 

The strain was inoculated to 15 ml TSB and after incubation in a shake flask for 12 h the cell 
suspension was centrifuged. 

The cells were washed once In TE (5 ml) and was further resuspended in 5 ml TSB followed by 
sonication for 3 x 30 s (MSE sonicator, microtip. power 6). After sonication the mixture was centrifuged 10 
20 000 xg for 10 minutes. 

The growth media and the extract from the cell sonication was assayed for protein A using the method 
described In Routine Methods. The results are shown in Table 1: 
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Plasmid Filamentous growth Expression % extra- 

• level (mg/1) cellular 

pASZ2 + 3.6 72 

These results show that the fragment is IgG binding. 

V Construction of dim eric Z fragment 

The single stranded DNA from M13-Z was annealed to sequencing primer followed by treatment with 
Klenow and dNTPs. 

In this was double stranded DNA obtained. The reaction mixture was digested with Hind lll and EcoRI 
and the Z fragment was Isolated from a Low Gel Temperature Agarose gel electrophoresis. Separately the 
plasmid pUC 8 was digested with §cc;RI and Hind lll and the large fragment was isolated from Low Gel 
Temperature Agarose gel electrophoresis. The two isolated gel fragments were heated to 65°C and 2.U.I of 
each melted fragment were ligated in 50 ul ligation buffer and 2.5 units DNA ligase at 15*C overnight. The 
DNA thus obtained was used to transform £ .goJi strain JM83 as above, and plasmid DNA from two 
transformants was isolated and proved to contain the Z fragment. The pUC 8-Z was digested with restriction 
enzyme AccL After religation of the digestion mixture a new transformation of JM83 was made, followed by 
plasmid isolation from 12 transformants. Digestion with Hind lll and EcgRI and analysis on agarose gel 
confirmed that 2 transformants carried the pUC 8 with an insert of two Z fragments (pUC 8-ZZ). 

VI Construction pi exoressionvector pZZ-lGF-l 

The construction of pZZ-IGF-l was made in the following way (Rg. 6): 

A. pUClB-ZZ was digested with Fsp I and EcoRI and the smallest fragment was isolated on LGT 
agarose. 

B. The plasmid vector pHL33 was digested with FspL The Jargest fragment (2273 bp) was isolated on 
LGT agarose. 

C. The plasmid pEX4~»GF-l was digested with Fsfil and EcoRI. The small fragment spanning over the 
IGF-I gene into the AMP gene was isolated. 



7 



0 230 869 



The three fragments from A, B and C were ligated together as described in Routing Methods and th« 
ligation mixture was transformed into EqoH JM83. 

Transformant selection was conducted using a LB agar medium containing 70 ag/ml of ampicillir 
Isolation of the plasmid DNA and analysis with restriction enzymes confirmed that the transformants carrie< 
s the plasmid pZZ-IGF-l. 



VII Growth of E.coli JM83 containing pZZ-IGF-l and purification of 1GF-I 

io 4 I of growth medium LB supplemented with 02 % glucose. 0.01 M MgSO* and Ampicillin (100 ug/ml 
was inoculated with E.coji JM83/pZZ-IGF-l. The cells were grown in shake flasks for 20 h at 37'C in 2 
flasks with 500 ml broth in each flask. The cells were spun down and kept at -20°C for 3 days. To tht 
frozen pellet (32 g net weight) 128 ml 10 mM tris HCI pH 8.0 solution was added and the cellsuspensior 
was gently mixed for 2 h at + 4°C. The cells were spun down by centrifugation and the supernatant wa: 

is recovered for affinity chromatography. The supernatant was recentrifuged before applied to an IgC 
sepharose column with 150 mM NaCI 50 mM tris HCI pH 7.5 (hereinafter TS). 

The supernatant was passed through the column at a speed of 12 ml/h and the amount of IgG binding 
material was analyzed before and after run through the column. The bound material was washed with Ti 
supplemented with 0.05 % Triton X-100 and then TS and finally with 0.05 M ammonium acetate before 

20 elution with 1 M acetic acid pH adjusted to 2.8 with ammonium acetate. The 2 ml fractions were assayed fo 
protein A content and the fractions were pooled and freeze dried. The hybrid protein was cleaved with 
hydrdxyl amine as described by Bornsteln, et al, Methods Enzymol.. 47. 132 (1977). The dry material wai 
dissolved in 2M hydroxylamine adjusted to pH 9 with Lithium chloride and 0.2 M Tris base. The cleavage 
was performed at 45 °C for '4 h. The cleavage reaction was stopped by lowering the pH to 7.0 with acetic 

25 acid and the material was desalted on a PD-10 column saturated with TS. The desalted fraction of 3.5 m 
was passed through a 1 ml IgG-sepharose column to separate IGF-I from the ZZ polypeptide. This materia 
was then desalted on a PD-10 column saturated with 0.2 M acetic acid, and then freeze dried. The materia 
was then analyzed on sodium dodecyle sulphate poly aery lam ide gel electrophores. This process is showr 
in Fig. 8. and SDS-polyacrylamide gel is shown in Fig. 9. The nucleotide sequence of the expressed fusior 

oo protein is shown in Fig. 4. 

These results shown that the ZZ fragment is functional in a process to purify IGF-I in respect to binding 
IgG to facilitate purification as well as harbouring a unique Asn-Gly cleavage site to release native IGF-I anc 
to enable the ZZ portion to bind to the IgG gel after a second passage through the gel. 

Although the invention has been exempliefied in regard to purification of IGF-I it is to be noted that the 

as invention is applicable to purification of any gene coding for a useful protein. Thus. IGF-l-purification is only 
one example of the usefulness of the invention and is not to be construed to limit the scope of the invention 
otherwise than as defined in the appended patent claims. 

The Asp-Pro dlpeptide constellation present in all protein A domains known to bind human IgG (Rg. 2) 
was designed to be changed to a Glu-Pro. This will make the normally acid labile Asp-Pro peptide bound 

40 resistant to acid treatment (like 70 % Formic Acid at 42° for 12 h) making any other introduced Asp-Pro 
coding linker unique. This designed amino acid change (Asp to Glu) is not shown In Examples. 



Claims 

1. A recombinant DNA fragment (Z) coding for an immunoglobulin G binding domain related to 
staphylococcal protein A, characterized In that the methionine codon of said fragment has been replaced by 
a codon of another amino acid residue enabling expression of a methioninefree protein. 

2. The fragment of claim 1 , wherein the codon of said another amino acid residue is that of leucine. 

3. The fragment of claim 1 or 2. wherein the asparagine-methionine codons have been replaced by 
histidine-leucine codons. 

4. The fragment of claim 1 , 2 or 3. wherein the codon of amino acid residue number one as defined by 
trypsin digestion of native protein A has been replaced by a valine codon so as to give at the nucleotide 
level the sequence GTAGAC furnishing a non-paRndromic Acc I site. 

5. The fragment of claim 1. 2. 3 or 4. wherein the glycine codon has been replaced by an alanine 
codon. 

6. The fragment of any preceding claim, wherein the -Asp-Pro codons have been modified to Increase 
the acid stability of the peptide bond of the expressed protein. 
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7. The fragment of claim 6, wherein the aspartic acid codon has been replaced by a glutamic acid 
codon. 

8, A recombinant DNA sequence comprising at least two Z-fragments as defined in any preceding 
claim. 

s 9. The sequence of claim 8. wherein the number of Z-fragments is within the range 2 to 15. 

10. A recombinant DNA fragment coding for any of the E D A B C domains of staphylococcal protein A 
or functional equivalents thereof, wherein the glycine codon<s) in the Asn-Gly coding constellation has been 
replaced by an alanine codon. 

11. A recombinant DNA sequence comprising the Z-fragment of any of claims 1-7 preceded by a signal 
io sequence and a nucleotide sequence coding for the amino acid sequence: 

Ala Gin His Asp Glu Ala. 

12. A recombinant DNA sequence comprising the sequence of claim 8 or 9 preceded by a signal 
sequence followed by a nucleotide sequence coding for the amino acids: 

Ala Gin His Asp Glu Ala. 

?5 13. A recombinant DNA molecule comprising the recombinant DNA sequence of claim 11 or 12 and 

fused 3' thereof at DNA level a production gene, said molecule having the ability to express a fused protein. 

14. The molecule of claim 13, wherein said production gene is that of a somatomedin. 

15. The molecule of claim 14, wherein said production gene is that of IGF-1. 

16. The molecule of claim 15, wherein the N-terminal glycine codon Is preceded by an asparagine 
20 codon to enable hydroxyl amino cleavage of the peptide bond to release native IGF-1 . 

17. The molecule of claim 14, wherein the production gene is that of IGF-2. 

18. The molecule of claim 17, wherein the N-terminal glycine codon is preceded by a methionine codon 
to enable cyanogen bromide cleavage of the peptide bond to release native IGF-2. 

19. A process for cleaving a fused protein expressed in a biological system by the recombinant DNA 
25 molecule of any of claims 16 to 18. 

20. The process according to claim 19 in its dependence on claim 17 or 18, wherein the cleavage is 
performed by CNBr-treatment. 

21. The process according to claim 19 in its dependence on claim 14 or 15, wherein the cleavage is 
performed by hydroxy l-amine treatment. 

30 22. A plasmid vector comprising the recombinant DNA molecule of any of claims 13-18. 

23. A bacterial cell harbouring the recombinant DNA molecule of any of claims 13-18. 

24. A bacterial cell according to claim 23 harbour ing said molecule In its chromosome. 

25. A bacterial cell according to claim 23 harbouring the plasmid vector according to claim 22. 

26. A bacterial cell according to any of claims 23-25. which is Gram negative, 
as 27. A bacterial cell according to claim 26, which is an E.coli. 

28. Proteins obtained by the process of any of claims 19^21. 
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|AlaGlnHisAspGluAla.Vc 
' ^-Cleavage [_ 



AAGCTTCCGCGGCAAATGCTGCGCAACACGATGAAGCCGTAGACAACAAATTCAACAAA 

Tal AspAsnLysPheAsnLy s 

^Z-region 
region 

60 70 80 90 ; 110 110 

GAACAACAAAACGCGTTCTATGAGATCTTACATTTACCTAACTTAAACGAAGAACAACGA 

GluGlnGlnAsnAlaPheTyrGluIleLeuHisLeuProAsnLeuAsnGluGluGlnArg 

120 130 140 150 160 170 

AACGCCTTCATCCAAAGTTTAAAAGATGACCCAAGCCAAAGCGCTAACCTTTTAGCAGAA 

Asn Ala PhelleGlnSerLeuLvsAspAspProSerGlnSerAlaAsnLeuLeuAlaGlu 



180 190 200 210AccI 220EcoRI 

GCTAAAAAGCTAAATGATGCTCAGGCGCCGAAAGTAGACGCGAATTC 

AlaLysLysLeuAsnAspAlaGlnAlaProLy^alAspAlaAsn 
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ZZ-IGF-1 894 BASES 

1 11 c / 21 31 41 51 

TCGAAATAGCGTGATTTTGCGGTTTTAAGCCTTTTACTTCCTGAATAAATCTTTCAGCAA 

61 71 81 91 101 111 

AAT/.TTTATTTTATAAGTTGTAAAACTTACCTTTAAATTTAATTATAAATATAGATTTTA 



121 131 141 151 161 171 

GTATTGCAATACATAATTCGTTATATTATGATGACTTTACAAATACATACAGGGGGTATT 

181 191 201 211 221 231 

AATTTGAAAAAGAAAAACATTTATTCAATTCGTAAACTAGGTGTAGGTATTGCATCTGTA 
LeuLysLysLysAsnlleTyrSerlleArgLysLeuGlyValGlylleAlaSerVal 

241 251 261 271 281 2.91 

ACTTTAGGTACATTACTTATATCTGGTGGCGTAACACCTGCTGCAAATGCTGCGCAACAC 

ThrLeuGlyThrLeuLeuIleSerGlyGlyValThrProAlaAlaAsnAlaAlaGlnHis 

The signal sequence Cleavage region 

301 311 AccI 321 331 341 351 

GATGAAGCCGTAGACAACAAATTCAACAAAGAACAACAAAACGCGTTCTATGAGATCTTA 
AspGluAlaValAspAsnLysPheAsnLysGluGlnGlnAsnAlaPheTyrGluIleLeu 
Z-region 

361 371 381 391 401 411 

CATTTACCTAACTTAAACGAAGAACAACGAAACGCCTTCATCCAAAGTTTAAAAGATGAC 

HisLeuProAsnLeuAsnGluGluGlnArgAsnAlaPhelleGlnSerLeuLysAspAsp 

421 431 441 451 461 471 

CCAAGCCAAAGCGCTAACCTTTTAGCAGAAGCTAAAAAGCTAAATGATGCTCAGGCGCCG 

ProSerGlnSerAlaAsnLeuLeuAlaGluAlaLysLysLeuAsnAspAlaGlnAlaPro 

481 AccI 491 501 511 521 531 

AAAGTAGACAACAAATTCAACAAAGAACAACAAAACGCGTTCTATGAGATCTTACATTTA 
LysValAspAsnLysPheAsnLysGluGlnGlnAsnAlaPheTYrGluIleLeuHisLeu 
Z-region 

541 551 561 571 581 591 

CCTAACTTAAACGAAGAACAACGAAACGCCTTCATCCAAAGTTTAAAAGATGACCCAAGC 

ProAsnLeuAsnGluGluGlnArgAsnAlaPhelleGlnSerLeuLysAspAspProSer 

601 611 621 631 641 651 

CAAAGCGCTAACCTTTTAGCAGAAGCTAAAAAGCTAAATGATGCTCAGGCGCCGAAAGTA 

GlnSerAlaAsnLeuLeuAlaGluAlaLysLysLeuAsnAspAlaGlnAlaProLysVal 

661. EcoR671 681 691 701 711 

GACGCGAATTCTAACGGTCCCGAAACTCTGTGCGGTGCTGAACTGGTTGACGCTCTGCAG 
AspAlaAsnSerAsnGlyProGluThrLeuCysGlyAlaGluLeuValAspAlaLeuGln 

IGF-1 

721 731 741 751 761 771 

TTTGTTTGCGGTGACCGTGGTTTTTATTTTAACAAACCCACTGGTTATGGTTCTTCTTCT 

PheValCysGlyAspArgGlyPheTYrPheAsnLysProThrGlyTyrGlySerSerSer 

781 791 801 811 821 831 

CGTCGTGCTCCCCAGACTGGTATTGTTGACGAATGCTGCTTTCGTTCTTGCGACCTGCGT 

ArgArgAlaProGlnThrGlylleValAspGluCysCysPheArgSerCysAspLeuArg 

841 851 861 871 881 891HindIII 

CGTCTGGAAATGTATTGCGCTCCCCTGAAACCCGCTAAATCTGCTTAGAAGCTT 

ArgLeuGluMetTyrCysAlaProLeuLysProAlaLysSerAla*** 
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